# **AgensGraph Quick Guide**
## **Copyright Notice**
Copyright © 2016-2024, Bitnine Inc. All Rights Reserved.
Restricted Rights Legend
PostgreSQL is Copyright © 1996-2024 by the PostgreSQL Global Development Group.
Postgres95 is Copyright © 1994-5 by the Regents of the University of California.
AgensGraph is Copyright © 2016-2024 by Bitnine Inc.
Permission to use, copy, modify, and distribute this software and its documentation for any purpose, without fee, and without a written agreement is hereby granted, provided that the above copyright notice and this paragraph and the following two paragraphs appear in all copies.
IN NO EVENT SHALL THE UNIVERSITY OF CALIFORNIA BE LIABLE TO ANY PARTY FOR DIRECT, INDIRECT, SPECIAL, INCIDENTAL, OR CONSEQUENTIAL DAMAGES, INCLUDING LOST PROFITS, ARISING OUT OF THE USE OF THIS SOFTWARE AND ITS DOCUMENTATION, EVEN IF THE UNIVERSITY OF CALIFORNIA HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
THE UNIVERSITY OF CALIFORNIA SPECIFICALLY DISCLAIMS ANY WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. THE SOFTWARE PROVIDED HEREUNDER IS ON AN ”AS-IS” BASIS, AND THE UNIVERSITY OF CALIFORNIA HAS NO OBLIGATIONS TO PROVIDE MAINTENANCE, SUPPORT, UPDATES, ENHANCEMENTS, OR MODIFICATIONS.
**Trademarks**
AgensGraph® is a registered trademark of Bitnine Global Inc. Other products, titles or services may be registered
**Open Source Software Notice**
Some modules or files of this product are subject to the terms of the following licenses. : OpenSSL, RSA Data Security, Inc., Apache Foundation, Jean-loup Gailly and Mark Adler, Paul Hsieh’s hash.
**Information of technical documentation**
Title : AgensGraph Quick Guide
Published date : 11 10, 2022
S/W version : AgensGraph v2.14.0
Technical documentation version : v1.0
# **Release Notes**
This document is for AgensGraph v2.14 community edition. See the release [link](https://github.com/bitnine-oss/agensgraph/releases).
# **Overview**
## **Basic information**
**About the Documentation**
This document is a short guide for AgensGraph developers. This guide will introduce various aspects of AgensGraph, including the data model and data definition language, Cypher query processing abilities, and Java development capabilities. Because AgensGraph is compatible with PostgreSQL, some parts of the documentation quote parts of PostgreSQL’s documentation or link to it.
**License**
Creative Commons 3.0 License ()
## **AgensGraph Introduction**
### **What is AgensGraph?**
AgensGraph is a new generation multi-model graph database for the modern complex data environment. AgensGraph is a multi-model database based on PostgreSQL RDBMS, and supports both relational and graph data models at the same time. This enables developers to integrate the legacy relational data model and the flexible graph data model in one database. AgensGraph supports Ansi-SQL and Open Cypher ([http://www.opencypher.org](http://www.opencypher.org/)). SQL queries and Cypher queries can be integrated into a single query in AgensGraph.
AgensGraph is very robust, fully-featured and ready for enterprise use. It is optimized for handling complex connected graph data and provides plenty of powerful database features essential to the enterprise database environment including ACID transactions, multi-version concurrency control, stored procedure, triggers, constraints, sophisticated monitoring and a flexible data model (JSON). Moreover, AgensGraph leverages the rich eco-systems of PostgreSQL and can be extended with many outstanding external modules, like PostGIS.
## **Features**
- Multi-model support of the property graph data model, relational data model and JSON documents
- Cypher query language support
- Integrated querying using SQL and Cypher
- Graph data object management
- Hierarchical graph label organization
- Property indexes on both vertexes and edges
- Constraints: unique, mandatory and check constraints
- ACID transactions
- Hadoop connectivity
- Active-standby replication
- And many other features
### **More Information**
Because AgensGraph inherits all features of PostgreSQL ([http://postgresql.org](http://postgresql.org/)), users and database administrators can refer to [the documentation of PostgreSQL](https://www.postgresql.org/docs/current/static/index.html) for more information.
# **Installation**
AgensGraph runs on Linux and Windows. There are two methods available to install AgensGraph: downloading the binary package or compiling the package from source code.
## **Installation of pre-built packages**
### **Installing AgensGraph on Linux**
1. Get the pre-compiled binary:
Visit the AgensGraph [download page](https://bitnine.net/downloads/) and download the corresponding version of AgensGraph.
```
*Tip: If you do not know your system environment, you can use the command:
uname -sm*
```
2. Extract the package:
Extract the downloaded file into a directory for your use (for example, /usr/local/AgensGraph/ on Linux)
tar xvf */path/to/your/use*
```
*Note : If you want AgensGraph on other operating systems, please contact Bitnine’s support team.*
```
## **Installation of build source code**
1. Access the AgensGraph[github](https://github.com/bitnine-oss/agensgraph) and get the source code.
$ git clone
2. Install the following essential libraries according to each OS.
1. CENTOS
$ yum install gcc glibc glib-common readline readline-devel zlib zlib-devel flex bison
2. Fedora
$ dnf install gcc glibc bison flex readline readline-devel zlib zlib-devel
3. Ubuntu
$ sudo apt-get install build-essential libreadline-dev zlib1g-dev flex bison
3. Go to the clone location and run configure on the source tree. The –prefix=/path/to/intall option allows you to set the location where AgensGraph will be installed.
$ ./configure
4. Run the build.
$ make install
5. Install the extension module and binary
$ make install-world
## **Post-Installation Setup and Configuration**
1. Environment variable setting (optional):
You can add these commands into a shell start-up file, such as the `~/.bash\_profile`.
```
*export LD\_LIBRARY\_PATH=/usr/local/AgensGraph/lib:$LD\_LIBRARY\_PATH
export PATH=/usr/local/AgensGraph/bin:$PATH
export AGDATA=/path/to/make/db\_cluster*
```
2. Creating a database cluster\*:
```
*initdb [-D /path/to/make/db\_cluster]*
```
3. Starting the server\*\*:
```
*ag\_ctl start [-D /path/created/by/initdb]*
```
4. Creating a database\*\*:
```
*createdb [dbname]*
```
If dbname is not specified, a database with the same name as the current user is created, by default.
5. Execute the interactive terminal:
```
*agens [dbname]*
```
If the db\_cluster directory is not specified with `-D` option, the environment variable `AGDATA` is used.
## **Configuring Server Parameters**
In order to attain optimal performance, it is very important to set server parameters correctly according to the size of data and machine resources. Among many server parameters, the following parameters are crucial for AgensGraph graph query performance. (You can edit `$AGDATA/postgresql.conf` file to set these parameters (restart required)).
- shared\_buffers: The size of memory for caching data objects. This parameter should be increased for the production environment. It is optimal when it is as large as the data size. But, this parameter should be set carefully considering concurrent sessions and memory size allocated for each queries. The recommended setting is half of the physical memory size.
- work\_mem: This shoud be also increased according to the size of physical memory and the properties of queries that will be executed carefully.
- random\_page\_cost: This parameter is for query optimization. For graph queries, it is recommended to reduce this value to 1 or 0.005 (in case graph data is fully cached in memory).
For more information, you can refer to [PostgreSQL documentation](https://www.postgresql.org/docs/current/static/runtime-config-resource.html).
# **Data Model**
## **AgensGraph Data Model**
AgensGraph is a multi-model database. AgensGraph simultaneously supports both the property graph model and the relational model.
### **Property Graph Model**

*Labeled Property Graph Model*
The property graph model contains connected entities, which can have any number of attributes. In AgensGraph, an entity is known as a vertex. Vertices can have an arbitrary number of attributes and can be categorized with labels. Labels are used to group vertices in order to represent some categories of vertices; i.e. representing the role of a person.
Edges are directed connections between two vertices. Edges can also have attributes and categorized labels like vertices. In AgensGraph, an edge always has a start vertex and an end vertex. If a query tries to delete a vertex, it must delete all its edges first. Broken edges cannot exist in AgensGraph.
Properties of edges and vertices are represented in the JSON format. JSON is a text format for the serialization of semi-structured data. JSONs are comprised of six data types: strings, numbers, booleans, null, objects and arrays. AgensGraph objects take full advantage of the JSON format by storing information as unordered collections of zero or more name/value pairs. A name is a string and a value can be any aforementioned type, including nested JSON types. AgensGraph specifically uses the JSONB format. Since JSONB is a decomposed binary format, it is processed much faster than regular JSON, but at the cost of a slightly slower input time.
### **Data Objects in AgensGraph**

*AgensGraph Simple Data model*
In AgensGraph, several databases can be created and each database can contain one or more schemas and graphs. Schemas are for relational tables, and graph objects are for graph data. Schema name and graph name can not be the same. Vertices and edges are grouped into labels. There are two kinds of labels: vertex labels and edge labels. Users can create several graphs in a database but only one graph could be used at one time.
### **Labels**
Labels are used to group vertices and edges. Users can create property indexes for all vertices under a given label. Labels can be used to provide access controls for different types of users, and label hierarchies can be created to add inheritance to labels. There is default labels for vertices: ag\_vertex. If one creates a vertex without specifying its label, then the vertex is stored in the default label. While An edge always has one label.
We call vertex label and edge label as VLABEL and ELABEL respectively.
- VLABEL : The vertex label. Categorizes vertices to represent their roles.
- Vertex : entities which can hold attributes.
- ELABEL : The edge label. Categorizes edges to represent their roles.
- Edge : relationships connecting entities.

*Edge Label inheritance example*
Every label inherits one or more labels. The figure above shows an example hierarchy of edge labels. The label hierachy is similar to a class hierarchy in object-oriented programming. Each parent label contains child label data. For example, given the above hierarchy, if a query matches with the edge ‘friends’ then, the results contain the data of the ‘roommate’ label.
## **Data Definition Language**
This section introduces DDLs for graph objects with a few examples.
### **Quick Description**
#### **Graph**
To create a graph, use the CREATE GRAPH command.
***CREATE***
```
CREATE GRAPH graphname;
```
Several graphs can be created in a database. In order to specify which graph is to be used, the session parameter *graph\_path* is used.
To show the current graph path, use the following command.
```
SHOW graph\_path;
```
When a graph is created using `CREATE GRAPH`, *graph\_path* is set to the created graph if *graph\_path* is not set. You can create multiple graphs and change *graph\_path* to another graph using the following command:
```
SET graph\_path = graphname;
```
The *graph\_path* is a session variable, so every client must set the *graph\_path* before querying the graph. Only one graph name can be specified for *graph\_path*. Querying over multiple graphs is not allowed.
If you set the *graph\_path* for each user or database using the `ALTER ROLE` or `DATABASE` statement, you do not need to run the `SET graph\_path` statement whenever you connect the database.
```
ALTER ROLE user1 IN DATABASE gdb SET graph\_path TO graphname;
ALTER DATABASE gdb SET graph\_path TO graphname;
```
***DROP***
```
DROP GRAPH graphname CASCADE;
```
A graph has initial labels for vertices and edges. These labels cannot be deleted. To drop the graph, users must do so with the `CASCADE` option. If current *graph\_path* is the deleted graph, then *graph\_path* is reset to null.
#### **Labels**
***CREATE***
```
CREATE VLABEL person;
CREATE VLABEL friend inherits (person);
CREATE ELABEL knows;
CREATE ELABEL live\_together;
CREATE ELABEL room\_mate inherits (knows, live\_together);
```
The keywords VLABEL and ELABEL are used for identifying vertices and edges respectively. CREATE VLABEL will create a vertex label. VLABEL can inherit VLABEL only (in other words, VLABEL cannot inherit ELABEL). `inherits` is an option to inherit a parent label. If not specified, the system sets the initial label as a parent label. Multiple inheritance is possible for creating complex labels.
***DROP***
```
DROP VLABEL friend;
DROP VLABEL person;
```
VLABEL friend inherits person, so VLABEL person cannot be dropped directly. VLABEL friend should be dropped first.
If a label is bound to any vertex or edge, you cannot delete this label by using DROP VLABEL or DROP ELABEL.
```
DROP VLABEL person;
ERROR: cannot drop person because it is not empty.
```
You can use `DROP ... CASCADE` to delete a label regardless of whether it is bound to or not, and in this case, all data object that has this label is deleted along with it.
```
DROP VLABEL person CASCADE;
NOTICE: drop cascades to vlabel friend
DROP VLABEL
DROP ELABEL knows CASCADE;
```
### **Detailed Description**
#### **GRAPH**
***CREATE GRAPH***
```
CREATE GRAPH [ IF NOT EXISTS ] graph\_name [AUTHORIZATION role\_name];
```
- IF NOT EXISTS
- Do nothing if the same name already exists
- AUTHORIZATION role\_name
- The role name of the user who will own the new graph
***ALTER GRAPH***
```
ALTER GRAPH graph\_name RENAME TO new\_name;
ALTER GRAPH graph\_name OWNER TO { new\_owner | CURRENT\_USER | SESSION\_USER };
```
- graph\_name
- The name of an existing graph.
- RENAME TO new\_name
- This form changes the name of a graph to new\_name.
- OWNER TO new\_owner
- This form changes the owner of the graph.
#### **LABEL**
***CREATE LABEL***
The synopsis of both VLABEL and ELABEL is identical.
```
CREATE [ UNLOGGED ] VLABEL [ IF NOT EXISTS ] label\_name [DISABLE INDEX]
[ INHERITS ( parent\_label\_name [, ...] ) ]
[ WITH (storage\_parameter)]
[ TABLESPACE tablespace\_name ];
```
- UNLOGGED
- Data written to an unlogged label is not recorded to the write-ahead log, which makes unlogged labels considerably faster than logged labels. However, unlogged labels are not crash-safe.
- IF NOT EXISTS
- Do nothing if the same name already exists.
- label\_name
- The name of the vertex/edge label to be created.
- DISABLE INDEX
- Create a label with invalid index. The invalid indexes can not be used for searching or inserting until reindexed.
- INHERITS ( parent\_label [, …] )
- The optional INHERITS clause specifies a list of vertex/edge labels. If it is empty, the new label inherits the initial label. Use of INHERITS creates a persistent edge between the new child label and its parent label(s). The data of the child label is included in scans of the parent(s) by default.
- TABLESPACE tablespace\_name
- The new label will be created in the tablespace whose name is tablespace\_name.
***ALTER LABEL***
```
ALTER [ IF EXISTS ] VLABEL label\_name RENAME TO new\_name;
ALTER [ IF EXISTS ] VLABEL label\_name OWNER TO new\_owner;
ALTER [ IF EXISTS ] VLABEL label\_name SET STORAGE { PLAIN | EXTERNAL | EXTENDED | MAIN };
ALTER [ IF EXISTS ] VLABEL label\_name SET TABLESPACE new\_tablespace;
ALTER [ IF EXISTS ] VLABEL label\_name CLUSTER ON idxname;
ALTER [ IF EXISTS ] VLABEL label\_name SET WITHOUT CLUSTER;
ALTER [ IF EXISTS ] VLABEL label\_name SET LOGGED;
ALTER [ IF EXISTS ] VLABEL label\_name SET UNLOGGED;
ALTER [ IF EXISTS ] VLABEL label\_name INHERIT parent\_label;
ALTER [ IF EXISTS ] VLABEL label\_name NO INHERIT parent\_label;
ALTER [ IF EXISTS ] VLABEL label\_name DISABLE INDEX;
```
- IF EXISTS
- Do not throw an error if the label does not exist.
- label\_name
- The name of an existing vertex/edge label.
- RENAME TO new\_name
- Changes the name of a label to new\_name.
- OWNER TO new\_owner
- Changes the owner of the label.
- SET STORAGE
- This form sets the storage mode for property. This controls whether property is held inline or in a secondary TOAST table, and whether the data should be compressed or not.
- PLAIN must be used for fixed-length, inline and uncompressed values.
- MAIN is inline and compressed.
- EXTERNAL is external and uncompressed.
- EXTENDED is external and compressed.
- SET TABLESPACE
- This form changes the label’s tablespace to the specified tablespace and moves the data files to the new tablespace. Indexes on the label are not moved.
- CLUSTER/WITHOUT CLUSTER
- This form select/remove the default index for future cluster operation. [See this](https://www.postgresql.org/docs/current/static/sql-cluster.html).
- SET LOGGED/UNLOGGED
- This form changes the label from unlogged to logged or vice-versa.
- INHERIT/NO INHERIT
- This form adds/removes the target label to/from the parent label’s children list.
- DISABLE INDEX
- This form changes all indexes of the label to invalid index. The invalid indexes can not be used for searching or inserting until reindexed.
#### **Property Index**
The property index provides a method to build indexes on the property values. Users can also create an index for an expression.
***CREATE INDEX***
```
CREATE [UNIQUE] PROPERTY INDEX [CONCURRENTLY] [IF NOT EXIST] indexname
ON labelname [USING method]
( attribute\_expr | (expr) [COLLATE collation] [opclass] [ASC | DESC]
[NULLS {FIRST | LAST}] )
[WITH] (storage\_parameter = value [,...])
[TABLESPACE tablespacename]
[WHERE predicate];
```
- UNIQUE
- Causes the system to check for duplicate values in the table when the index is created if the data already exists and each time data is added. Attempts to insert or update data which would result in duplicate entries will generate an error.
- CONCURRENTLY
- With this option, the index will be built without taking any locks that prevent concurrent
CREATE, DELETE(DETACH), SET on the label. There are several caveats to be aware of when using this option. [See this](https://www.postgresql.org/docs/9.5/static/sql-createindex.html#SQL-CREATEINDEX-CONCURRENTLY).
- IF NOT EXIST
- Do nothing if the same name already exists
- indexname
- The name of the index to be created. If it is omitted, AgensGraph chooses an adequate name.
- labelname
- The name of the label to be indexed.
- USING METHOD
- The name of the index method to be used. They are btree, hash, gist, spgist, gin and brin.
The default method is btree.
- attribute\_expr
- An expression which is pointing to the attribute of property.
- COLLATE collation
- The name of the collation to use for the index. By default, the index uses the collation declared for the column to be indexed or the result collation of the expression to be indexed. Indexes with non-default collations can be useful for queries that involve expressions using non-default collations.
- opclass
- The name of an operator class.
- ASC | DESC
- Specified ascending/descending sort order. ASC is the default.
- NULLS { FIRST | LAST }
- Specifies that nulls sort before/after non-nulls. LAST is default when ASC is specified.
- WITH storage\_parameter
- The name of an index-method-specific storage parameter. See [Index Storage Parameters](https://www.postgresql.org/docs/current/static/sql-createindex.html#SQL-CREATEINDEX-STORAGE-PARAMETERS) for details.
- TABLESPACE tablespacename
- The tablespace in which to create the index.
- WHERE predicate
- The constraint expression for a partial index.
***DROP INDEX***
```
DROP PROPERTY INDEX indexname [CASCADE | RESTRICT];
```
- indexname
- The indexname can be checked with `ag\_property\_indexes` or `\dGi`.
- CASCADE
- Automatically drop objects that depend on the graph.
- RESTRICT
- Refuse to drop the index if any objects depend on it. This is the default.
#### **Constraints**
This section provides information about how to use constraints on properties. Users can create two types of constraints; UNIQUE constraint and CHECK constraint.
```
CREATE CONSTRAINT constraint\_name ON label\_name ASSERT field\_expr IS UNIQUE;
CREATE CONSTRAINT constraint\_name ON label\_name ASSERT check\_expr;
DROP CONSTRAINT constraint\_name ON label\_name;
```
- field\_expr
- This form represents a JSON expression. This JSON can be a nested JSON.
- UNIQUE
- This form specified that json\_expression can contain only unique values.
- check\_expr
- The check expression returns a boolean result for new or updated properties. The result must be TRUE or UNKNOWN for an insert or update to succeed. Should any property of an insert or update operation produce a FALSE result, an error exception is raised and the insert or update will rolled back.
# **AgensGraph Query**
## **Graph Query**
### **Introduction**
To retrieve and manipulate graph data, AgensGraph supports the Cypher query language. Cypher is a declarative language similar to SQL. Cypher is easy to learn since its syntax visually describes the patterns found in graphs.
This guide briefly explains how to write Cypher queries using an example graph.
### **Creating an Example Graph**
AgensGraph can store multiple graphs in a single database. However, Cypher has no way of discerning multiple graphs. Therefore, AgensGraph supports additional Data Definition Languages and variables to create and manage graphs using Cypher.
The following statements create a graph called *network* and set it as a current graph.
```
**CREATE** GRAPH **network**;
**SET** graph\_path = **network**;
```
In this example, the *graph\_path* variable is explicitly set to *network*. However, if *graph\_path* is not set before creating the graph, it will be set automatically after creating the graph.
#### **Creating the Labels**
Before creating graph data, generating a label is basic. Although this is the default, label is generated automatically when you specify label in the CREATE statement in cypher(VLABEL/ELABEL can both be created).
```
*note: Be careful not to confuse the label with the lable. Unintentional new labels can be created.*
```
All graph elements have one label. For vertex, if no label is specified, have `ag\_vertex` as a default label. For edge, the label can not be omitted. `ag\_edge` label also exists but is used for other purposes.
AgensGraph supports DDL’s to create such labels.
The following statements create a vertex label *person* and an edge label *knows*.
```
**CREATE** VLABEL person;
**CREATE** ELABEL knows;
**CREATE** (n:movie {title:'Matrix'});
```
#### **Creating the Vertices and Edges**
Now, we can create vertices for *person* and edges for *knows* by using Cypher’s `CREATE` clause. The `CREATE` clause creates a pattern that consists of vertices and edges. A vertex has the form: `(variable:label {property: value, ...})`, and an edge has: `-[variable:label {property: value, ...}]-`. An additional `<` on the leftmost side or `>` on the rightmost side is used to denote the direction of the edge. `variable` can be omitted if created vertices and edges are not to be referenced.
>*note: AgensGraph does not support -- grammar for an edge in a pattern because -- means a comment to the end of the line.*
The following statements create three simple patterns: “Tom knows Summer”, “Pat knows Nikki” and “Olive knows Todd”.
```
CREATE (:person {name: 'Tom'})-[:knows {fromdate:'2011-11-24'}]->(:person {name: 'Summer'});
CREATE (:person {name: 'Pat'})-[:knows {fromdate:'2013-12-25'}]->(:person {name: 'Nikki'});
CREATE (:person {name: 'Olive'})-[:knows {fromdate:'2015-01-26'}]->(:person {name: 'Todd'});
MATCH (p:Person {name: 'Tom'}),(k:Person{name: 'Pat'})
CREATE (p)-[:KNOWS {fromdate:'2017-02-27'} ]->(k);
```
To store properties of vertices and edges, AgensGraph uses PostgreSQL’s *jsonb* type. Properties can have nested JSON objects as their values. Since AgensGraph uses PostgreSQL’s type system, any data type supported by PostgreSQL can be stored into the properties of vertices and edges.
### **Querying the Graph**
Let’s retrieve the pattern we created above. Cypher has the `MATCH` clause to find a pattern in a graph.
The following statement finds the pattern, “A person called Tom knows a person”.
MATCH (n:person {name: 'Tom'})-[:knows]->(m:person) **RETURN** n.name **AS** n, m.name **AS** m;
```
n | m
-------+----------
"Tom" | "Summer"
"Tom" | "Pat"
(2 **rows**)
```
Since properties are of the *jsonb* type, we need methods to access their property values. PostgreSQL supports those methods through operators such as `->`, `->>`, `#>`, and `#>>`. If a user wants to access the property *name* of vertex *m*, one can write `(m)->>name`. AgensGraph offers an alternate way to access these elements. AgensGraph uses the dot operator . and bracket operators `[]` on vertices and edges to access property values in JSON objects and elements in JSON arrays as shown above.
The `RETURN` clause returns variables and its properties as a result of the query. The result is a table which has multiple matched patterns in its rows.
***Variable Length Edges***
Let’s consider a query that finds knows of ‘Tom’ and knows of knows. We can use the `UNION` clause:
```
MATCH (p:person {name: 'Tom'})-[:knows]->(f:person)
RETURN f.name
UNION ALL
MATCH (p:person {name: 'Tom'})-[:knows]->()-[:knows]->(f:person)
RETURN f.name;
```
It can also be written as:
```
MATCH (p:person {name: 'Tom'})-[r:knows\*1..2]->(f:person)
RETURN f.name, r[1].fromdate;
```
A query looking for vertices located after a variable length of edge-vertex paths is typical in graph databases. `*1..2` used in the edge represents such a variable-length edge. Where `1` is the minimum length of the edge and `2` is the maximum length. If you do not specify a value, the default range values are 1 and infinity.
You can also use update clauses such as `CREATE`, `SET`, `REMOVE` and `DELETE` after `MATCH` clauses. In the next section, we will see how data in a graph can be modified and deleted.
### **Manipulating the Graph**
You can set properties on vertices and edges using the `SET` clause. If you set a null value to a property, the property will be removed.
The following statement finds the given pattern and updates the property in the matched edge.
```
MATCH (:person {name: 'Tom'})-[r:knows]->(:person {name: 'Summer'})
SET r.since = '2009-01-08';
```
To delete vertices and edges in a graph, we can use the DELETE clause.
The following statement finds a vertex and deletes it.
```
MATCH (n:person {name: 'Pat'}) DETACH DELETE (n);
```
The above example actually uses the `DETACH DELETE` clause to delete vertices and edges attached to the vertices all at once.
The final shape of the graph *network* is as follows:
```
MATCH (n)-[r]->(m) RETURN n.name AS n, properties(r) AS r, m.name AS m;
n | r | m
---------+---------------------------------------------------+----------
"Tom" | {"since": "2009-01-08", "fromdate": "2011-11-24"} | "Summer"
"Olive" | {"fromdate": "2015-01-26"} | "Todd"
(2 rows)
```
### **MERGE**
If you need to ensure that a pattern exists in the graph, you can use `MERGE`. It will try to find the pattern if the pattern exists in the graph, or else it creates the pattern if it does not exist. If the pattern exists, it is treated like a `MATCH` clause, otherwise it is treated like a `CREATE` clause. `MERGE` is a `MATCH` or `CREATE` of the entire pattern. This means that if any element of the pattern does NOT exist, AgensGraph will create the entire pattern.
The following statement guarantees that everyone’s city exists in the graph.
```
CREATE VLABEL customer;
CREATE VLABEL city;
CREATE (:customer {name:'Tom', city:'santa clara'}),
(:customer {name:'Summer ', city:'san jose'}),
(:customer {name:'Pat', city:'santa clara'}),
(:customer {name:'Nikki', city:'san jose'}),
(:customer {name:'Olive', city:'san francisco'});
MATCH (a:customer)
MERGE (c:city {name:a.city});
MATCH (c:city) **RETURN** properties(c);
properties
-------------------------
{"name": "santa clara"}
{"name": "san jose"}
{"name": "san francisco"}
(3 rows)
```
`MERGE` can perform `SET` depending on whether the pattern is MATCHed or CREATed. If it is MATCHed, it will execute the `ON MATCH SET` clause. If it is CREATE-ed, it will execute the `ON CREATE SET` clause.
```
CREATE (:customer {name:'Todd', city:'palo alto'});
MATCH (a:customer)
MERGE (c:city {name:a.city})
ON MATCH SET c.matched = 'true'
ON CREATE SET c.created = 'true';
MATCH (c:city) RETURN properties(c);
properties
----------------------------------------------
{"name": "santa clara", "matched": "true"}
{"name": "san jose", "matched": "true"}
{"name": "san francisco", "matched": "true"}
{"name": "palo alto", "created": "true"}
(4 rows)
```
The default isolation level of the transaction is Read committed on AgensGraph. Therefore, if another transaction executes `MERGE` for the same pattern at the same time, two (or more) identical patterns can be created at the same time. To prevent this, you can execute a transaction at the serializable isolation level as in the example below. If an attempt is made to create the same pattern concurrently, one transaction fails with an error. If you retry a failed transaction, it will behave like `MATCH` instead of `CREATE` because of the pattern created by the succeeded transaction already exists.
```
BEGIN;
SET TRANSACTION ISOLATION LEVEL SERIALIZABLE;
MATCH (a:customer)
MERGE (c:city {name:a.city});
COMMIT;
```
### **Finding the Shortest Path**
The `shortestpath` function can be used to find the shortest path between two vertices. If you want to find all paths, you can use the `allshortestpaths` function.
```
MATCH (p:person {name:'Tom'}), (f:person {name:'Olive'}) CREATE (p)-[:knows]->(f);
MATCH (p1:person {name: 'Tom'}), (p2:person {name: 'Todd'}),
path=shortestpath((p1)-[:knows\*1..5]->(p2)) RETURN path;
```
In this example, we create a ‘knows’ edge path from ‘Tom’ to ‘Olive’. To find the ‘knows’ path from ‘Tom’ to ‘Todd’, we can use the `shortestpath` function. The `shortestpath` function takes a pattern consisting of a start vertex, an edge and an end vertex. We can use a variable length edge expression in the edge to specify if we are looking for a certain degree of connection.
The query results are as follows.
```
[person[3.1]{"name": "Tom"},knows[4.5][3.1,3.5]{},person[3.5]{"name": "Olive"},knows[4.3][3.5,3.6]
{"fromdate": "2015-01-26"},person[3.6]{"name": "Todd"}]
```
### **Finding the Edge (Head, Tail, Last)**
The Head, Tail, Last each functions can be used to find the Edges between two vertices.
```
create (p:person{name:'Tom'})-[e:knows]->(p2:person{name:'Olive'});
create (p:person{name:'Tomas'})-[e:knows]->(p2:person{name:'Todd'});
match (p:person), (p2:person) where p.name='Olive' and p2.name='Tomas'
create (p)-[e:unknows]->(p2);
MATCH (p1:person {name: 'Tom'})-[e\*]->(p2:person {name: 'Todd'}) RETURN e;
MATCH (p1:person {name: 'Tom'})-[e\*]->(p2:person {name: 'Todd'}) RETURN head(e);
MATCH (p1:person {name: 'Tom'})-[e\*]->(p2:person {name: 'Todd'}) RETURN tail(e);
MATCH (p1:person {name: 'Tom'})-[e\*]->(p2:person {name: 'Todd'}) RETURN last(e);
```
In this example, we create a ‘knows’ edge path from ‘Tom’ to ‘Olive’ and ‘Tomas’ to ‘Todd’ and create a ‘unknows’ edge from ‘Olive’ to ‘Tomas’. To find the each path information from ‘Tom’ to ‘Todd’, we can use the `haed`,‘tail’, ‘last’ function. The head function shows the first edge information, the tail function shows the edge information from the second to the last, and the last function shows the last edge information.
The query results are as follows.
```
[e*]
[knows[13.1][12.1,12.2]{},unknows[14.1][12.2,12.3]{},knows[13.2][12.3,12.4]{}]
[head(e)]
knows[13.1][12.1,12.2]{}
[tail(e)]
[unknows[14.1][12.2,12.3]{},knows[13.2][12.3,12.4]{}]
[last(e)]
knows[13.2][12.3,12.4]{}
```
## **Hybrid Query**
### **Introduction**
In this section, we will see how to use SQL and Cypher together in AgensGraph using the following example graph.
Hybrid Query performs aggregation and statistical processing on table and column by using SQL query used in RDB,and the Cypher query used by GDB supports better data query than RDB’s Join operation.
```
CREATE GRAPH bitnine;
CREATE VLABEL dev;
CREATE (:dev {name: 'someone', year: 2015});
CREATE (:dev {name: 'somebody', year: 2016});
CREATE TABLE history (year, event)
AS VALUES (1996, 'PostgreSQL'), (2016, 'AgensGraph');
```
#### **Cypher in SQL**
Since the result of a Cypher query is a relation, you can use a Cypher query in the FROM clause of SQL as if it is a subquery. It is possible to use Cypher syntax inside the FROM clause to utilize a dataset of vertex or edge stored in graph DB as data in a SQL statement.
Syntex :
```
SELECT [column\_name]
FROM ({table\_name|SQLquery|CYPHERquery})
WHERE [column\_name **operator** value];
```
Example :
```
SELECT n->>'name' as name
FROM history, (MATCH (n:dev) RETURN n) as dev
WHERE history.year > (n->>'year')::int;
name
---------
someone
(1 row)
```
#### **SQL in Cypher**
When querying the content of the graph DB with the cypher queries, you can use the match and where clause when you want to search using specific data of the RDB. However, the resulting dataset in the SQL queries must be configured to return a single row of results.
Syntex :
```
MATCH [table\_name]
WHERE (column\_name operator {value|SQLquery|CYPHERquery})
RETURN [column\_name];
```
Example :
```
MATCH (n:dev)
WHERE n.year < (SELECT year FROM history WHERE event = 'AgensGraph')
RETURN properties(n) AS n;
n
-----------------------------------\
{"name": "someone", "year": 2015}
(1 row)
```
# **Graph Data Import**
This section introduces an example to import graph data from external foreign files.
1. Install the extension ‘file\_fdw’ necessary to use the AgensGraph foreign-data wrapper to interface with files on the server’s filesystem.
```
CREATE EXTENSION file\_fdw;
```
2. Create the data import server.
```
CREATE SERVER import\_server FOREIGN DATA WRAPPER file\_fdw;
```
3. Create the foreign table to receive data from a foreign file.
```
CREATE FOREIGN TABLE vlabel\_profile (id graphid, properties text)
SERVER import\_server
OPTIONS( FORMAT 'csv', **HEADER** 'false', FILENAME '/path/file.csv', delimiter E'\t');
```
```
CREATE FOREIGN TABLE elabel\_profile (id graphid, start graphid, "end" graphid, properties text)
SERVER import\_server
OPTIONS( FORMAT 'csv', **HEADER** 'false', FILENAME '/path/file.csv', delimiter E'\t');
```
This ‘properties text’ item should be written differently depending on the situation. See the following example.
```
comments.csv
id|creationDate|locationIP|browserUsed|content|length // This is the HEADER of CSV
2473901162497|1316237219961|77.240.75.197|Firefox|yes|3
2473901162498|1316225421570|213.180.31.8|Firefox|thanks|6
4123168604165|1345407771283|41.203.141.129|Firefox|LOL|3
```
In this case, the schema of foreign table should be as follows.
```
create foreign table comments\_profile
(
id int8,
creationDate int8,
locationIP varchar(80),
browserUsed varchar(80),
content varchar(2000),
length int4
)
server import\_server
options
(
FORMAT 'csv',
HEADER 'true', //Indicates the presence of a csv header
DELIMITER '|',
NULL '',
FILENAME '/path/to/comments.csv'
);
```
4. Execute the import. The data must be cast as type JSONB, since vertices and edges in AgensGraph are stored in the JSONB format.
```
CREATE VLABEL test\_vlabel;
LOAD FROM vlabel\_profile AS profile\_name
**CREATE** (a:test\_vlabel =to\_jsonb(row\_to\_json(profile\_name)));
```
```
CREATE ELABEL test\_elabel;
LOAD FROM elabel\_profile AS profile\_name
MATCH (a:test\_vlabel), (b:test\_vlabel)
WHERE (a).id = to\_jsonb(profile\_name).start AND (b).id = to\_jsonb(profile\_name).end
CREATE (a)-[:test\_elabel]->(b);
```
Following the example above, you can do the following:
```
CREATE VLABEL comments;
LOAD FROM comments\_profile AS ROW
CREATE (:comments =to\_jsonb(row\_to\_json(row)));
```
### **Indexes with Data Import**
The cost of maintaining indexes during bulk insertion is very expensive. AgensGraph provides grammars to toggle indexes by disabling them temporarily and reindexing them later. The disabled indexes do not interfere with bulk inserts. After bulk data import, the option REINDEX LABEL can be used. It will take some time but it is much faster than bulk insert with valid indexes.
```
CREATE VLABEL test\_vlabel DISABLE INDEX;
OR
CREATE VLABEL test\_vlabel;
ALTER VLABEL test\_vlabel DISABLE INDEX;
-- DATA IMPORT
REINDEX VLABEL test\_vlabel;
```
# **Tools**
## **Command Line Interface Tool**
AgensGraph has a command line interface tool called *agens*. Users can query and maintain AgensGraph using *agens* efficiently.
> *Note: agens is based on psql (the PostgreSQL interactive command line tool) so that all features of psql can be used in AgensGraph too.*
### **Meta Command**
```
\dG[+] [PATTERN] list graphs
\dGe[+] [PATTERN] list graph edge labels
\dGl[+] [PATTERN] list graph labels
\dGv[+] [PATTERN] list graph vertex labels
\dGi[+] [PATTERN] list graph property indexes
```
*agens* supports the above additional meta-commands for administration and scripting. You can obtain a list of graph or labels. If + is appended to the command, each object is listed with its associated permissions and description, if any. If *pattern* is specified, only objects whose name match the pattern are listed.
# **Client Drivers**
## **Java Driver**
### **Introduction**
This section briefly explains how to process graph data through AgensGraph to Java application developers.
AgensGraph’s JDBC driver is based on the PostgreSQL JDBC Driver and offers a way for the Java application developer to access the Agensgraph database in their applications. The API of the AgensGraph Java Driver and Postgres JDBC Driver are very similar. The only difference is that AgensGraph uses the Cypher query language instead of SQL and utilizes graph data (vertices, edges and paths) as data types.
### **Usage of the Java Driver**
#### **Get the Driver**
You can download the precompiled driver(jar) from [Download link](http://bitnine.net/downloads) or use maven as follows:
```
...
net.bitnine
agensgraph-jdbc
1.4.2
...
```
You can search the latest version on [The Central Repository with Groupld and Artifactld](http://search.maven.org/#search%7Cgav%7C1%7Cg%3A%22net.bitnine%22%20AND%20a%3A%22agensgraph-jdbc%22).
#### **Connection**
To connect to AgensGraph using the Java Driver, we need two things: the name of the class to be loaded into the Java Driver and the connection string.
- The name of the class is `net.bitnine.agensgraph.Driver`.
- The connection string consists of the sub-protocol, server, port, database.
- `jdbc:agensgraph://`is the sub-protocol to use a particular Driver and a hold value.
- It is written as `jdbc:agensgraph://server:port/database`, including the sub-protocol.
The following code is an example of how to connect AgensGraph. It connects to AgensGraph through the `Connection` object and is ready to be queried.
```
import java.sql.DriverManager;
import java.sql.Connection;
public class AgensGraphTest {
public static void main(String[] args) {
try{
Class.forName("net.bitnine.agensgraph.Driver");
String connectionString = "jdbc:agensgraph://127.0.0.1:5432/agens";
String username = "test";
String password = "test";
Connection conn = DriverManager.getConnection(connectionString, username, password);
...
} catch (Exception e) {
e.printStackTrace();
}
}
}
```
#### **Retrieving Data**
This example illustrates how to export Graph data using MATCH.
A Cypher query is executed using `executeQuery()`. The output is a `ResultSet` object, which is the same output format found in the JDBC driver. The output of the query is the ‘vertex’ type of AgensGraph. Java Driver returns this output as a Vertex instance. Because the Vertex class is a sub-class of `Jsonb`, users can obtain information from the property fields.
```
import java.sql.DriverManager;
import java.sql.Connection;
import java.sql.Statement;
import java.sql.ResultSet;
import net.bitnine.agensgraph.graph.Vertex;
public class AgensGraphTest {
public static void main(String[] args) {
try{
...
Statement stmt = conn.createStatement();
ResultSet rs = stmt.executeQuery(
"MATCH (:Person {name: 'John'})-[:knows]-(friend:Person)" +
"RETURN friend");
while (rs.next()) {
Vertex friend = (Vertex)rs.getObject(1);
System.out.println(friend.getString("name"));
System.out.println(friend.getInt("age"));
} catch (Exception e) {
e.printStackTrace();
}
}
}
}
```
#### **Creating Data**
The following example illustrates how to insert a vertex with a label Person into AgensGraph. Users can input a property of a vertex using strings in Cypher queries. They can also be bound after making a Jsonb.
```
import java.sql.DriverManager;
import java.sql.Connection;
import java.sql.PreparedStatement;
import** net.bitnine.agensgraph.util.Jsonb;
import** net.bitnine.agensgraph.util.JsonbUtil;
public class AgensGraphTest {
public static void main(String[] args) {
...
PreparedStatement pstmt = conn.prepareStatement("CREATE (:person ?)");
Jsonb john = JsonbUtil.createObjectBuilder()
.add("name", "John")
.add("from", "USA")
.add("age", 17)
.build();
pstmt.setObject(1, john);
pstmt.execute();
}
}
```
The following is generated as a string:
```
"CREATE (:Person {name: 'John', from: 'USA', age: 17})"
```
[Reference]
In JDBC, the question mark (?) is the placeholder for the positional parameters of a PreparedStatement. There are, however, a number of SQL operators that contain a question mark. To avoid confusion, when used in a prepared statement, it must be used with spaces.